Improving Reinforcement Learning of an Obstacle Avoidance Behavior with Forbidden Sequences of Actions

نویسندگان

  • C. Touzet
  • N. Giambiasi
  • S. Sehad
چکیده

This paper is concerned with the improvement of reinforcement learning through the use of forbidden sequences of actions. A given reinforcement function can generate multiple effective behaviors. Each behavior is effective only considering the cumulative reward over time. It may not be the behavior expected by the designer. In this case, the usual solution is to modified the reinforcement function so as to introduce a new constraint related to the behavioral aspect to express. Alternatively, we propose not to modify the reinforcement function, but to add an external module containing generic forbidden sequences of actions. Experiments with the real miniature robot Khepera in a task of learning an obstacle avoidance allow to confirm the interest of this approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Avoidance of Multiple Dynamic Obstacles

this article is a continuation of the previous article called “Obstacle Avoidance in Dynamic Environment: a Hierarchical Solution”, which presented the concept for obstacle avoidance in dynamic environment suitable for mobile robot. The task of obstacle avoidance is divided in three principal groups: local, global and for emergencies. The global avoidance is here approached, in which the concep...

متن کامل

Self-learning Fuzzy Navigation of Mobile Vehicle

This paper describes a self-learning navigation method which utilizes fuzzy logic and reinforcement learning for navigation of a mobile vehicle in uncertain environments. The proposed navigator consists of three modules: Obstacle Avoidance, Move to Goal and Fuzzy Behavior Supervisor. The fuzzy rules of the on-line obstacle avoidance are learnt through reinforcement learning. A new and powerful ...

متن کامل

Dynamic Obstacle Avoidance with PEARL: PrEference Appraisal Reinforcement Learning

Manual derivation of optimal robot motions for task completion is difficult, especially when a robot is required to balance its actions between opposing preferences. One solution has been to automatically learn near optimal motions with Reinforcement Learning (RL). This has been successful for several tasks including swing-free UAV flight, table tennis, and autonomous driving. However, high-dim...

متن کامل

Reactive Vision-Based Navigation Controller for Autonomous Mobile Agents

Initial results of an ongoing research in the field of reactive mobile autonomy are presented. The aim is to create a reactive obstacle avoidance method for mobile agent operating in dynamic, unstructured, and unpredictable environment. The method is inspired by the stimulus-response behavior of simple animals. An obstacle avoidance controller is developed that uses raw visual information of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995